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Abstract. Band-power estimates of cosmic microwave 
background fluctuations are now routinely used to place 
constraints on cosmological parameters. For this to be 
done in a rigorous fashion, the full likelihood function of 
band-power estimates must be employed. Even for Gaus- 
sian theories, this likelihood function is not itself Gaus- 
sian, for the simple reason that band-powers measure the 
variance of the random sky fluctuations. In the context of 
Gaussian sky fluctuations, we use an ideal situation to mo- 
tivate a general form for the full likelihood function from a 
given experiment. This form contains only two free param- 
eters, which can be determined if the 68% and 95% confi- 
dence intervals of the true likelihood function are known. 
The ansatz works remarkably well when compared to the 
complete likelihood function for a number of experiments. 
For application of this kind of approach, we suggest that 
in the future both 68% and 95% (and perhaps also the 
99.7%) confidence intervals be given when reporting ex- 
perimental results. 

Key words: cosmic microwave background - Cosmology: 
observations - Cosmology: theory 



1. Introduction 

Six years after their first detection by the COBE satel- 
lite (Smoot et al. 1992), it is now well appreciated that 
cosmic microwave background (CMB) temperature fluc- 
tuations contain rich information concerning virtually all 
the fundamental cosmological parameters of the Big Bang 
model (Bond et al. 1994; Knox 1995; Jungman et al. 1996). 
New observations from a variety of experiments, ground- 
based and balloon-borne, as well as the two planned satel- 
lite missions, MAPQ and Planck Surveyor^, are and will 
be supplying a constant stream of ever more precise data 
over the next decade. 



It is in fact already possible to extract interesting 
information from the existing data set, consisting of al- 
most 20 different experimental results (Lineweaver et al. 
1997; Bartlett et al. 1998a,b; Bond & Jaffe 1998; Efs- 
tathiou et al. 1998; Hancock et al. 1998; Lahav & Bridle 
1998; Lineweaver & Barbosa 1998a, b; Lineweaver 1998; 
Webster et al. 1998; Lasenby et al. 1999). These exper- 
imental results are most often given in the literature as 
power estimates within a band defined over a restricted 
range of spherical harmonic orders. Our compilation, sim- 
ilar to those of Lineweaver et al. (1997) and Hancock et 
al. (1998), is shown in Figure 1 and may be accessed at 
our web site^j. The band is defined either directly by the 
observing strategy, or during the data analysis, e.g., the 
electronic differencing scheme introduced by Netterficld 
et al. (1997). This permits a concise representation of a 
set of observations, reducing a large number of pixel val- 
ues to only a few band-power estimates, and for this rea- 
son the procedure has been referred to as "radical com- 
pression" (Bond et al. 1998). If the sky fluctuations are 
Gaussian, as predicted by inflationary models, then little 
or nothing has been lost by the reduction to band-powers 
(Tegmark 1997). This is extremely important, because the 
limiting factor in statistical analysis of the next generation 
of experiments, such as, e.g., BOOMERanGQ, MAXIMA^, 
and Archeops[| is calculation time. Working with a much 
smaller number of band-powers, instead of the original 
pixel values, will be essential for such large data sets. The 
question then becomes how to correctly treat the statis- 
tical problem of parameter constraints starting directly 
with band-power estimates. 

Standard approaches to parameter determination, 
whether they be frequentist or Bayesian, begin with the 
construction of a likelihood function. For Gaussian fluctu- 
ations, the only kind we consider here, this is a multivari- 
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Fig. 1. Present CMB power spectrum estimates. Flat 
band-powers are shown as a function of multipole 
order I. The data and references can be found at 



well known, this amounts to a likelihood approach for ob- 
servables with a Gaussian probability distribution. Band- 
power estimates do not fall into this category (Knox 1995; 
Bartlett et al. 1998c; Bond et al. 1998; Wandelt et al. 1998) 
- they are not Gaussian distributed variables, not even in 
the case of underlying Gaussian temperature fluctuations. 
The reason is clear: power estimates represent the variance 
of Gaussian distributed pixel values (the sky temperature 
fluctuations), and they therefore have a distribution more 
closely related to the ^-distribution. 

We begin, in the following section, by a general dis- 
cussion of the likelihood approach applied to CMB ob- 
servations. In the context of an ideally simple situation, 
we find the exact analytic form for the likelihood func- 
tion of a band-power estimate. Reflections concerning the 
likelihood function in the context defined by actual exper- 
iments motivates us to propose this analytic form as an 
approximation, or ansatz, in the more general case. It is 
extremely easy to use, requiring little information in order 
to be applied to an experimental setup, because it contains 
only two adjustable parameters. These can be completely 
determined if one is given two confidence intervals, say 
the 68% and 95% confidence intervals, of the true, under- 
lying likelihood distribution (notice that here we see the 
non-Gaussian nature of the likelihood - a Gaussian func- 
tion would only require one confidence interval, besides 



tittp: //astro. u-strasbg.f r/Obs/COSMO/CMB/tabcmb.htmtlLe best power estimate, to be completely determined). 

We ask that in the future at least two confidence inter- 
vals be given when reporting experimental band-power 
estimates (more would be better, say for adjusting more 
complicated functional forms). An important limitation 
of the approach is the inability at present to account for 
more than one, correlated band-powers, as will be dis- 
cussed further below. 



The solid curve is a flat CDM model with = 0.4, 
A = 0.6, H a = 40 km/s/Mpc, Q = 19 ^K, fl b h 2 = 0.006 
and n = 0.94, while the dotted line represents an open 
model with Q = 0.2, A = 0, H Q = 60 km/s/Mpc, 
Q = 20 /iK, £lbh 2 = 0.015 and n = 1 (no gravitational 
waves and no reionization). 



ant Gaussian in the pixel temperature values, where the 
covariance matrix is a function of the model parameters 
(see below). The likelihood is then used as a function of 
the parameters, but as just mentioned, the large number 
of pixels makes this object very computationally cumber- 
some. It would be extremely useful to be able to define a 
likelihood function starting directly with the power esti- 
mates in Figure 1. This is the concern of this paper, where 
we develop an approximation to the the full likelihood 
function which requires only band-power estimates and 
very limited experimental details. As always in such proce- 
dures, it is worth emphasizing that the likelihood function, 
and therefore all derived constraints, only applies within 
the context of the particular model adopted. In our dis- 
cussion, we shall focus primarily on inflationary scenarios, 
whose theoretical predictions have become easily calcula- 
ble thanks to the development of fast Boltzmann codes, 
such as CMBFAST (Seljak & Zaldarriaga 1996; Zaldar- 
riaga et al. 1998). 

Much of the recent work on parameter determination 
has relied on the traditional x 2- fitting technique. As is 



We quantitatively test the accuracy of the approxima- 
tion in Section 3 by comparison to several experiments for 
which we have calculated the full likelihood function. The 
approximation works remarkably well, and it can repre- 
sent a substantial improvement over both single and "2- 
winged" Gaussian forms commonly used in standard x 2 ~ 
analyses; and it is as easy to use as the latter. The pro- 
posed likelihood approximation, the main result of this 
paper, is given in Eqs. Jl8| ) - (po|). We plan to maintain a 
web page^J with a table of the best fit parameters required 
for its use. Detailed application of the approximate likeli- 
hood function to parameter constraints and to tests of the 
Gaussianity of the observed fluctuations is left to future 
papers. Other, similar work has been performed by Bond 
et al. (1998) and Wandelt et al. (1998). 



2. Likelihood Method 
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2.1. Generalities 

Temperature anisotropics are described by a 2— 
dimensional random field A(n) = (ST/T)(h), where h is 
a unit vector on the sphere. This means we imagine that 
the temperature at each point has been randomly selected 
from an underlying probability distribution, characteristic 
of the mechanism generating the perturbations (e.g., In- 
flation), ft is convenient to expand the field in spherical 
harmonics: 

A(n) = 22 ai m Yi m (h) (1) 

1 m 

For Inflation generated perturbations, the coefficients 
aim are Gaussian random variables with zero mean - 

< dim >ens— - and covariance 

< Q-lraa*l' m i >ens= ClSu>S mm ' (2) 

This latter equation defines the power spectrum as the set 
of Ci. The indicated averages are to be taken over the 
theoretical ensemble of all possible anisotropy fields, of 
which our observed CMB sky is but one realization. Since 
the harmonic coefficients are Gaussian variables and the 
expansion is linear, it is clear that the temperature values 
on the sky are also Gaussian, and they therefore follow a 
multivariate Gaussian distribution (with an uncountably 
infinite number of variables, one for each position on the 
sky) . The covariance of temperatures separated by an an- 
gle 6 on the sky is given by the correlation function 

C{d) =< A(fti)A(na) > ens = ^ ^(21 + l ) C i p i^) ( 3 ) 

i 

where Pi is the Legendre polynomial of order / and /i = 
cos 9 = hi ■ fi2- The form of this equation, which follows 
directly from Eq. (||), is dictated by the statistical isotropy 
of the perturbations - the two-point correlation function 
can only depend on separation. 

Observationally, one works with sky brightness inte- 
grated over the experimental beam 

A b (h p ) = J dilA(h)B(h pi h) (4) 

where B is the beam profile and h p gives the position of 
the beam axis. The beam profile may or may not be a 
sole function of h p ■ h, i.e., of the separation between sky 
point and beam axis; if it is, then this equation is a simple 
convolution on the sphere, and we may write 

C b {9) =< A b ( ni )A b (h 2 ) > ens = -L Y (21 + l)Q (5) 

Air ^-^ 

i 

x\Bi\ 2 Pi(fi) 

for the beam-smeared correlation function, or covariance 
between experimental beams separated by 9. The beam 
harmonic coefficients, are defined by 

B(9') = ±J2(2l + l)B l P l (n>) (6) 
i 



3 

with h p ■ h = cos 9' = // . For example, for a Gaussian 
beam, B{9) = \j (2^a 2 )e~^ 1^ and Bi = e -^+i)^ 2 /2. 

Given these relations and a CMB map, it is now 
straightforward to construct the likelihood function, 
whose role is to relate the N P - 1X observed sky tempera- 
tures, which we arrange in a data vector with elements 
di = A b (hi), to the model parameters, represented by a 
parameter vector O . As advertised, for Gaussian fluctua- 
tions (with Gaussian noise) this is simply a multivariate 
Gaussian: 

£(3) . Prob(^) = (2 , )j v pte 1 /2|c|1/2 «-* T - C ~ 1 ^(7) 

The first equality reminds us that the likelihood function 
is the probability of obtaining the data vector given the 
model as defined by its set of parameters. In this expres- 
sion, C is the pixel covariance matrix: 

Cij d^dj ^ ens ^ij ^ij (^) 

where the expectation value is understood to be over the 
theoretical ensemble of all possible universes realisable 
with the same parameter vector. The second equality sep- 
arates the model's pixel covariance, T, from the noise in- 
duced covariance, N. According to Eq. (j^), = C b (9ij). 
The parameters may be either the individual Ci (or band- 
powers, discussed below), or the fundamental cosmological 
constants, fl, H a , etc... In the former case, Eq. (||) shows 
how the parameters enter the likelihood; in the latter sit- 
uation, the parameter dependence enters through detailed 
relations of the kind Ci [ & ] , specified by the adopted model 
(e.g., Inflation). Notice that if one only desires to deter- 
mine the Ci, then only the assumption of Gaussianity is 
required. 

Many experiments report temperature differences; and 
even if the starting point is a true map, one may wish 
to subject it to a linear transformation in order to de- 
fine bands in Z-space over which power estimates are to 
be given. Thus, it is useful to generalize our approach to 
arbitrary homogeneous, linear data combinations, repre- 
sented by a transformation matrix A: It' = A ■ ~d*. since 
the transformation is linear, the new data vector retains a 
multivariate Gaussian distribution (with zero mean), but 
with a modified covariance matrix: C = A ■ C ■ A*. As a 
consequence, the transformed pixels, ~d , may be treated 
in the same manner as the originals, and so we will here- 
after use the term generalized pixels to refer to the ele- 
ments of a general data vector which may be either real 
sky pixels or some transformed version thereof. The ele- 
ments of the new theory covariance matrix are (using the 
summation convention) 

T[j = A im A jn T mn = 1-^(21 + \)dW l3 (I) (9) 

where Wij(l) = A im A jn Pi(iJ lmn )\Bi\ 2 . The window func- 
tion is usually defined as Wu(l), i.e., the diagonal elements 
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of a more general matrix W(l). Normally, one tries to find 
a transformation which leads to a strongly diagonal W(Z) 
and diagonal noise matrix (see comment below). 

An example is helpful. Consider a simple, single differ- 
ence Arfj// = Afi(ni) — Ab(fi2), whose variance is given 
by < Afotf >ens= 2[C 6 (0) - C b (0)}. This may be written 
in terms of multipoles as 

< A% ff >= i- + m {2|s ; i 2 [i - PiO*)]} (io) 

l 

identifying the diagonal elements of W as the expression 
in curly brackets. Notice that the power in this variance 
is localized in Z-space, being bounded towards large I by 
the beam smearing and towards small I by the difference. 
The off-diagonal elements of C depend on the relative 
positions and orientations of the differences on the sky; 
in general these elements are not expressible as simple 
Legendre series. 

Band-powers are defined via Eq. (||). One reduces the 
set of Ci contained within the window to a single num- 
ber by adopting a spectral form. The so-called flat band- 
power, <5Tfb, is established by using Cj = 2ir(STfb) 2 /[Z(Z + 
1)], leading to 

In this fashion, we may write Eq. (Q) in terms of the band- 
power and treat the latter as a parameter to be estimated. 
This then becomes the band-power likelihood function, 
£(6T{b). One obtains the points shown in Figure 1 by 
maximizing this likelihood function; the errors are typi- 
cally found by in a Bayesian manner, by integration over 
C with a uniform prior. Notice that the variance due to 
the finite sample size (i.e., the sample variance, but also 
known as cosmic variance when one has full sky coverage) 
is fully incorporated into the analysis - the likelihood func- 
tion "knows" how many pixels there are. 

An important remark at this stage concerns the con- 
struction of Figure 1. We see here that this figure is only 
valid for Gaussian perturbations, because it relies on Eq. 
(0), which assumes Gaussianity at the outset. If the sky 
fluctuations are non-Gaussian, then these estimates must 
all be re-evaluated based on the true nature of the sky 
fluctuations, i.e., the likelihood function in Eq. (]?]) must be 
redefined. The same comment applies to any experiment 
which has an important non-Gaussian noise component 
- the likelihood function must incorporate this aspect in 
order to properly yield the power estimate and associated 
error bars. 

What is the raison d'etre for these band powers? The 
likelihood function is clearly greatly simplified if we can 
find a transformation A which diagonalizes C (signal plus 
noise). This can be done for a given model, but because C 
depends on the model parameters, there is in general no 
unique such transformation valid for all parameter values. 



The one exception is for an ideal experiment (no noise, or 
uniform, uncorrelated noise) with full-sky coverage - in 
this case the spherical harmonic transformation is guaran- 
teed, by Eq. (|^) , to diagonalize C for any and all values of 
the model parameters. This linear transformation is repre- 
sented by a matrix Aij = Yi(fij), where i = Z 2 + Z + rn+lis 
a unidimensional index for the pair (Z, m). It is the role of 
band-powers to approximately diagonalize the covariance 
matrix in more realistic situations, where sky coverage is 
always limited and noise is never uniform (and sometimes 
correlated) , and in such a way as to concentrate the power 
estimates in as narrow bands as possible. Since this is not 
possible for arbitrary parameter values, in practice one 
adopts a fiducial model (particular values for the param- 
eters) to define a transformation A which compromises 
between the desires for narrow and independent bands 
(Bond 1995, Tegmark et al. 1997, Tegmark 1997, Bunn & 
White 1997). 

2.2. Motivating an Ansatz 

Given a set of band-powers, how should one proceed to 
constrain the fundamental cosmological parameters, de- 
noted in this subsection by O? If we had an expression 
for £(<$Tfb), for our set of band-powers 5Tfb, then we could 
write C(SJa,) = Prob(t\SJ&) = Prob(lt\5T^\€>]) = 
C(Q). Thus, our problem is reduced to finding an expres- 
sion for £(<5Tfb), but as we have seen, this is a compli- 
cated function of <5Tfb, requiring use of all the measured 
pixel values and the full covariance matrix with noise - 
the very thing we are trying to avoid. Our task then is to 
find an approximation for C(STfb). In order to better un- 
derstand the general form expected for £(8Tfb), we shall 
proceed by first considering a simple situation in which we 
may find an exact analytic expression for this function. We 
are guided by the observation that the covariance matrix 
may always be diagonalized around an adopted fiducial 
model. Although this remains strictly applicable only for 
this model, we imagine that the likelihood function could 
be approximated as a simple product of one-dimensional 
Gaussians near this point in parameter space. If we fur- 
ther suppose that the diagonal elements of the covariance 
matrix (its eigenvalues) are all identical, we can find a very 
manageable analytic expression for the likelihood in terms 
of the best power estimate. We will then pose this general 
form as an ansatz for more realistic situations, one which 
we shall test in the following section. We return to these 
remarks after developing the ansatz. 

Consider, then, a situation in which the band temper- 
atures (that is, generalized pixels which are the elements 

of the general data vector a ) are independent random 
variables (C is diagonal) and that the experimental noise 
is spatially uncorrelated and uniform: 

c ij = (°m + ct jv)<% ( 12 ) 
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where o~ 2 M is the model-predicted variance and a% is the 
constant noise variance. For simplicity, we assume that 
all diagonal elements of W are the same, implying that 
a 2 M is a constant, independent of i. We discuss shortly the 
nature of such a data vector in actual observational set- 
ups. This situation is identical to one where iV P i X values 
are randomly selected from a single parent distribution 
described by a Gaussian of zero mean and variance a 2 M + 
o 2 N . The band-power we wish to estimate is proportional 
to the model-predicted variance according to (i.e., Eq. [ll]) 

* 2 m = 5Tl x \ £ ^±^ Wii (l) = 6TlK band (13) 

(independent of i), and we know that in this situation the 
maximum likelihood estimator for the model predicted 
variance is simply 



1 ^ 2 2 
^ i— 1 

as follows from maximizing the likelihood function 



(14) 



1 



[2n(a 



N 



N pIX /2 



Notice that this is a function of <jm > which peaks at the 
best estimate <tm, and whose form is specified by the pa- 
rameters <7m, o~ n and iVpix- To obtain the likelihood func- 
tion for the band-power, we simply treat this as a func- 
tion of STfb, using Eq. (|l3|), parameterized by STr,, <7/v 



and N„ 



C(STtb) 



[2n{ST 2 h TZ band + a 2 N )] N r 



c/2 



(15) 



xe 



= G(5Tfb; STfb, crjv, Ap ix ) 

It clearly peaks at JTfb. Thus, in this ideal case, we have a 
simple band-power likelihood function, with correspond- 
ing best estimator, STa,, given by Eq. (|l4|). 

Although not immediately relevant to our present 
goals, it is all the same instructive to consider the dis- 
tribution of 5Tfb. This is most easily done by noting that 
the quantity 



E 



(16) 



M 



N 



is x 2 ~distributed with N p - lx degrees of freedom. We may 
express the maximum likelihood estimator for the band- 
power in terms of this quantity as 



ST, 



I'b 



n, 



band 



'n) v 2 



>N 



(17) 



From < x% ■ > = ^Ypixj we see immediately that the esti- 
mator is unbiased 

Its variance is 



Var(«fb 



K 



' M 



2 \2 



band 



N 2 

pix 



-Var(x 



2U7 2 .(a 

band \ 



^■2 ^2 



pix 



explicitly demonstrating the influence of sample/cosmic 
variance (related to iVpi x ). 

All the above relations are exact for the adopted sit- 
uation - Eq. ( p"5| ) is the complete likelihood function for 
the band-power defined by the generalized pixels satisfy- 
ing Eq. Such a situation could be practically real- 
ized on the sky by observing well separated generalized 
pixels to the same noise level; for example, a set of dou- 
ble differences scattered about the sky, all with the same 
signal-to-noise. This is rarely the case, however, as scan- 
ning strategies must be concentrated within a relatively 
small area of sky (one makes maps!). This creates impor- 
tant off-diagonal elements in the theory covariance ma- 
trix T, representing correlations between nearby pixels 
due to long wavelength perturbation modes. In addition, 
the noise level is quite often not uniform and sometimes 
even correlated, adding off-diagonal elements to the noise 
covariance matrix. Thus, the simple form proposed in Eq. 
( |l2| ) is never achieved in actual observations. Nevertheless, 
as mentioned, even in this case one could adopt a fiducial 
theoretical model and find a transformation A which di- 
agonalizes the full covariance matrix C, thereby regain- 
ing one important simplifying property of the above ideal 
situation. The diagonal elements of the matrix are then 
its eigenvalues. Because of the correlations in the original 
matrix, we expect there to be fewer significant eigenval- 
ues than generalized pixels; this will be relevant shortly. 
One could then work with a reduced matrix consisting 
of only the significant eigenvalues, an approach reminis- 
cent of the signal-to-noise eigenmodes proposed by Bond 
(1995), and also known as the Karhunen-Loeve transform 
(Bunn & White 1997, Tegmark et al. 1997). There remain 
two technical difficulties: the covariance matrix does not 
remain diagonal as we move away from the adopted fidu- 
cial model by varying <5Tfb - only when this band-power 
corresponds to the fiducial model is the matrix really diag- 
onal. The second complicating factor is that the eigenval- 
ues are not identical, which greatly simplified the previous 
calculation. 

All of this motivates us to examine the possibility 
that a likelihood function of the form ( |l5| ) could be 
applied, with appropriate redefinitions of N piK and o~n. 
We therefore proceed by renaming these latter v and 
(3, respectively, and treating them as parameters to be 
adjusted to best fit the full likelihood function. Thus, 
given an actual band-power estimate, ST^l (i.e. 



an 
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experimental result), we propose G{5T^\ 8T^ , 0, v 
an ansatz for the banc 
parameters v and (3: 



-power likelihood function, with 



(18) 



X[STq,] = 



(°)l2 
fb J 



P 2 



We have only two parameters - v and /3 — to determine 
in order to apply the ansatz. This can be done if two 
confidence intervals of the complete likelihood function 
are known in advance. For example, suppose we were given 
both the 68% (er^ & u^g) and 95% (crj~ 5 & ct^~ 5 ) confidence 
intervals; then we could fix the two parameters with the 
equations 



0.68 



0.95 



r °° d[ST {h ] C(5T & ) 



d[5T & ] £(<STfb) 



/ °° d[JTfb] £(<5Tfb) 



(19) 



(20) 



We shall see in the next section (Figures 2-7) that this 
produces excellent approximations. This is the main result 
of this paper. 

Unfortunately, most of the time only the 68% confi- 
dence interval is reported along with an experimental re- 
sult (we hope that in the future authors will in fact sup- 
ply at least two confidence intervals). Is there any way to 
proceed in this case? For example, one could try to ju- 
diciously choose v and then adjust (3 with Eq. (|19|). The 
most obvious choice for v would be v = N pix , although 
from our previous discussion, we expect this to be an up- 
per limit to the number of significant dcgrces-of-freedom 
(the significant eigenvalues of C), due to correlations be- 
tween pixels. The comparisons we are about to make in 
the following section show that a smaller number of effec- 
tive pixels (i.e., value for v) is in fact required for a good 
fit to the true likelihood function. One could try other 
games, such as setting v = (scan length)/ (beam FWHM) 
for unidimensional scans. This also seems reasonable, and 
certainly this number is less than or equal to the actual 
number of pixels in the data set, but we have found that 
this does not always work satisfactorily. The availability 
of a second confidence interval permits both parameters, 
v and /?, to be unambiguously determined and in such a 
way as to provide the best possible approximation with 
the proposed ansatz. 

Bond et al. (1998) have recently examined the nature 
of the likelihood function and discussed two possible ap- 
proximations. The form of the ansatz just presented is 
in fact identical to one of their proposed approximations, 



parameterized by x and G. These parameters are simply 
related to our v and (3 as follows: x — (3 2 and G = v. 

Notice that the above development and motivation for 
the ansatz essentially follow for a single band-power. A 
set of uncorrelated power estimates is then easily treated 
by simple multiplication. However, the approximation as 
proposed does not simultaneously account for several cor- 
related band-powers, and it's accuracy is therefore limited 
by the extent to which such inter-band correlations are 
important in a given data set. As a further remark along 
these lines, we have noted that flat-band estimates of any 
kind, be it from a complete likelihood analysis or not, do 
not always contain all relevant experimental information, 
(Douspis et al. 2000); any method based on their use is 
then fundamentally limited by nature of the lost informa- 
tion. 

The only way to test the ansatz is, of course, by direct 
comparison to the full likelihood function calculated for 
a number of experiments. If it appears to work for a few 
such cases, then we may hope that it's general application 
is justified. We now turn to this issue. 



3. Testing the approximation 

In order to quantitatively test the proposed ansatz, we 
have calculated the complete likelihood function for sev- 
eral experiments. Our aim will be to compare the true 
likelihoods to the approximation. Figures 2-5 summarize 
our comparisons with the Saskatoon and MAX data sets. 
For the Saskatoon and MAX experiments, we compare 
the approximation directly to the band-power likelihood 
functions. In all cases, the complete likelihood functions 
have been calculated as outlined in Section 2 above. 

The first comparison will be made to the Saskatoon Q- 
band 1995 4-point and 10-point differences (experimen- 
tal information can be found in Netterfield et al. 1997; 
all relevant information concerning the experiment can be 
found on the group's web pageQ ; for useful and detailed in- 
formation on a number of experiments, see Caltech's web 
page^) . This particular choice of window functions was ar- 
bitrary. The approximation, applied using the constraints 
( |l9| ) and (^0|), is shown in Figures 2 and 3 as the dashed 
(red) curve. We see that it provides a good representation 
of the complete likelihood functions, traced by the solid 
(black) curves in each figure; in fact, the fit is truly spec- 
tacular for the 10-point difference. Taking as a benchmark 
the rule-of-thumb that 1, 2 and 3 a confidence intervals 
may be estimated by 2 A In C = 1,4 and 9, respectively, we 
sec that the approximation reproduces almost perfectly all 
of these, and more. 

Consider now setting v = N p i x = 24 and 48, for the 
4-point and 10-point differences, respectively, and then 
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Fig. 2. Comparison to the Saskatoon Q-band 1995 4-point difference. The value of the likelihood is plotted as a 
function of the band-power, STr,, in both linear (left) and logrithmic (right) scales. The solid (black) curve in each 
case gives the true likelihood function, while the dashed (red) curve corresponds to the proposed approximation based 
on two confidence intervals. The dot-dashed (blue) curve is the ansatz with v = N p - lx = 24 and adjusted to the 68% 
confidence interval (see text). A "2-winged Gaussian" with different positive-going and negative-going errors is shown 
as the three-dotted-dashed (green) curve. All curves have been normalized to unity at their peaks. 
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Fig. 3. Comparison to the Saskatoon Q-band 1995 10-point difference. The line-styles are the same as in the previous 
figure; here JV p j x = 48 for the dot-dashed (blue) line. 
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Fig. 4. Comparison to the MAX ID likelihood function. This is the combined likelihood for the 3 frequency channels, 
3.5, 6 and 9 cm -1 . Linestyles are the same as in the previous two figures, and here N p i K = 21 applies to the dot-dashed 
(blue) line. 



adjusting (3 to the 68% confidence interval. In so doing, 
we obtain the dot-dashed (blue) curves, which in fact are 
not too bad in both cases. These values of N p i X should 
be compared to the values of v = 16 and 41 found pre- 
viously by adjusting to two confidence intervals. Thus, 
we see that the effective number of degrees-of-freedom 
describing these Saskatoon likelihood functions is indeed 
v < N p i x , as we expected from the above discussion. 

Finally, the 3-dot-dashed (green) curves show "2- 
winged" Gaussians with separate positive- and negative- 
going variances, sometimes employed in traditional x 2 ~ 
analyses. This is also a fare representation of the two like- 
lihood functions, although the proposed ansatz does per- 
form slightly better. We will return to this point, but we 
should not be too surprised that the Gaussian works rea- 
sonably well when, as here, v becomes large (all the same, 
notice that the curves are not symmetric and that a sin- 
gle Gaussian, with a single a, would not fare particularly 
well). 

Comparison to the MAX experiment is shown in Fig- 
ure 4 for the region ID (experimental details can be found 
in Tanaka et al. 1996); we have combined all three fre- 
quency channels to construct the complete likelihood func- 
tion. The scan strategy consisted in taking iVpi x = 21 
single differences aligned along a unidimensional scan. 
Once again, the approximation, applied using Eqs. ( |l9| ) 
and (|20j) , supplies an excellent representation of the like- 
lihood function, down to values well below "3<r" (0.01 of 



the peak). The effective number of degrees-of-freedom is 
v = 8.5, demonstrating again that v < N plx . Here, the 
difference is rather large, due to the significant overlap 
between adjacent pixels along the scan, and we see that 
the ansatz with v = A^ p i X does not produce a good ap- 
proximation. 

Could there a way to proceed if only one confidence in- 
terval is given? This would require a choice for one of the 
parameters, say based on some knowledge of the scan 
strategy. We have just seen that for MAX v = N plx leads 
to a bad representation of the likelihood function. One 
might be tempted to try instead v — (scan length) /(beam 
FWHM)= 8.8 , which is in fact very close to the best value 
of v found from adjusting to two confidence intervals. Al- 
though this is successful in this case, it is nevertheless 
guess-work, the problem being that it is really not clear if 
there is a unique rule for judiciously choosing v. For Saska- 
toon, v — Npi X worked reasonably well, while here it does 
not, something much less being required because of the 
significant redundancy in the scan. We have found that it 
is difficult to justify a priori a general rule for choosing v 
when lacking two confidence intervals. The most sure way 
of finding the effective number of degrees-of-freedom to 
be used in the ansatz remains the use of two confidence 
intervals, via Eqs. (|l|) and (|o|). 

A noteworthy aspect of this MAX likelihood function 
is its asymmetry, i.e., it is manifestly non-Gaussian. Even 
a "2-winged" Gaussian is clearly a very bad representa- 
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Fig. 5. Comparison to a Saskatoon bin consisting of the 10-, 11- and 12-point differences (bands) from the K-band 
1994, Q-band 1994 and Q-band 1995 CAP data. As previously, the solid black line shows the true likelihood function, 
while the dashed (red) curve displays the approximation based on two confidence intervals. This Figure demonstrates 
that the approximation works well even when several individual bands are combined to form a band-power likelihood. 



tion. As the number of statistically independent elements 
entering the power estimation increases, we should expect 
the likelihood function to approach a Gaussian distribu- 
tion. The question is, what is meant by statistically in- 
dependent elements! It is obviously not something like 
2Z + 1, for MAX covers multipoles near 100; rather, as we 
have argued above, it is really the parameter v which mea- 
sures this, what we have been calling the effective number 
of degrees-of-freedom. The fact that v < iV p j x tells us 
that the number of generalized pixels is an upper limit 
to this number degrees-of-freedom determining the non- 
Gaussian nature of the likelihood function. We make the 
connection to the familiar 21 + 1-rule only when we have 
full-sky coverage and bands consisting of single multi- 
poles; then, the number of generalized pixels defining each 
(single multipole) band corresponds to 21 + 1. In the gen- 
eral case, it is more useful and correct to reason with the 
number of pixels (really, v). We may also conclude from 
this that although experiments with relatively large sky 
coverage should provide Gaussian likelihood functions on 
scales much smaller than the survey area, band-power es- 
timates on scales approaching the survey area will always 
be non-Gaussian. The proposed ansatz represents a sub- 
stantial improvement over either a single or "2-winged" 
Gaussian in such cases. 



These comparisons focus on simple cases where the 
power over a single band defined by the observing strat- 
egy is to be estimated, although in the MAX case the 
analysis did include three frequency channels simultane- 
ously. A more subtle test of the approximation is its ex- 
tension to a power estimate over several bands defined 
by different window functions. Such is the situation pre- 
sented by the five standard Saskatoon power bins. Each 
bin comprises several bands, of the type considered above, 
and the bin power is estimated using the joint likelihood 
of the contributing bands, including all band-band corre- 
lations. One could worry that the information carried by 
several bands might not be adequately incorporated by 
the two parameters of the ansatz. 

In Figure 5 we compare the approximation to the like- 
lihood function of a combination of 10-, 11- and 12-point 
differences. Included are the K-band 1994 and Q-band 
1994 and 1995 CAP data. The true likelihood function for 
this bin is calculated from the complete covariance matrix 
accounting for all correlations, and the approximation was 
fit using two confidence intervals. Even in this more com- 
plicated situation we see that the ansatz continues to work 
quite well, once the appropriate best power estimate and 
errors for the complete bin are used to find v and /?. 

It is on the basis of such comparisons that we believe 
the proposed ansatz and method of application produces 
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acceptable likelihood functions. Besides the comparisons 
shown here, we have also tested the approximation against 
11 other complete likelihood functions, all kindly provided 
by K. Ganga; these comparisons may be viewed on our 
web pageP|. The approximation works well in all cases. 
We emphasize again that the particular value of the pro- 
posed ansatz resides in its simplicity - we obtain very good 
approximations with little effort. 



4. Conclusion 

Study of CMB temperature fluctuations have over the 
short interval of time since their discovery become the cos- 
mological tool with the greatest potential for determining 
the values of the fundamental cosmological constants. The 
present data set is already capable of eliminating some re- 
gions of parameter space, and this is only a fore-taste of 
what is to come. Experimental results are often quoted as 
band-power estimates, and for Gaussian sky fluctuations, 
these represent a complete description of an observation. 
Because there arc far fewer band-powers than pixel values 
for any given experiment, the reduction to band-powers 
has been called "radical compression" (Bond et al. 1998); 
and as the number of pixels explodes with the next in- 
strument generations, this kind of compression will be- 
come increasingly important in any systematic analysis of 
parameter constraints. 

For these reasons, it is extremely useful to develop sta- 
tistical methods which take as their input power estimates. 
Since most standard methods use as a starting point the 
likelihood function, one would like to have a simple expres- 
sion for this quantity given a power estimate - one that 
does not require manipulation of the entire observational 
pixel set. One difficulty is that even for Gaussian sky fluc- 
tuations, the band-power likelihood function is not Gaus- 
sian, most fundamentally because the power represents an 
estimate of the variance of the pixel values. For any fidu- 
cial model, the data covariance matrix can be diagonalized 
and the likelihood function near this point in parameter 
space expressed as a product of individual Gaussians in 
the data elements (this is strictly speaking only possi- 
ble for the model in question). This consideration lead 
us to examine the ideal situation where the eigenvalues 
of C were all identical, for which we can analytically find 
the exact form of the likelihood function in terms of the 
best power estimate. Using this as motivation, we have 
proposed the same functional form for band-power like- 
lihood functions, Eq. (18), as an ansatz in more general 
cases. It contains two free parameters, v and f3, which 
may be uniquely determined if two confidence intervals 
of the full likelihood function (the thing one is trying to 
fit) are known; for example, the 68% and 95% confidence 
intervals (Eqs. 19 and ^). We have seen that the result- 
ing approximate distributions match remarkably well the 



complete likelihood functions for a number of experiments 
- those discussed here as well as 11 others (calculated by 
K. Ganga and B. Ratra). All of these comparisons may be 
viewed at our web siteP], where we also plan to provide 
and continually up-date the appropriate parameter values 
v and (3 for each published experiment. 

Although at least one confidence interval is normally 
given in the literature (usually at 68%), a second con- 
fidence interval is rarely quoted. To aid the kind of ap- 
proach proposed here, we would ask that in the future ex- 
perimental band-power estimates be given with at least 
two likelihood-based confidence intervals (additional in- 
tervals, such as 99.8%, would allow one to fit other func- 
tional forms with 3 free parameters). This remains the 
surest way of finding the effective number of degrees-of- 
frccdom of the likelihood, v. An otherwise a priori choice 
for this number appears difficult, among other things be- 
cause it depends on the nature of the scan strategy. We 
have noted in this light that v < N p - lx , precisely because 
of correlations between pixels, which depend on the scan 
geometry. 

One important aspect of the approximate nature of 
the proposed method is its inability to account for correla- 
tions between several band-powers. When analyzing a set 
of band-powers, one is obliged to simply multiply together 
their respective approximate likelihood functions. The ac- 
curacy of the approximation is thus limited by the extent 
to which inter-band correlations are important. Although 
one's desire is to give experimental results as independent 
power estimates, this is not always possible. Furthermore, 
and as discussed in Douspis et al. (2000), the very use 
of flat-band powers may lead to a loss of relevant exper- 
imental information otherwise contained in the original 
pixel data. The accuracy of any method based on their 
use is thus additionally limited by the importance of this 
lost information. These limitations define in practice the 
approximate nature of the proposed method. 

Another important point to make is that the approx- 
imation is extremely easy to use, as easy as the (inap- 
propriate) x 2 method; and for experiments with a small 
number of significant degrees-of-freedom, it represents a 
substantial improvement over the latter. This is the case, 
for example, with the MAX ID likelihood function, and 
it will always be the case when estimating power on the 
largest scales of a survey. When the effective number of 
degrees-of-freedom becomes large, a Gaussian becomes an 
acceptable approximation, and the gain in using the pro- 
posed ansatz is less significant. Nevertheless, the approxi- 
mation's facile applicability promotes its use even in these 
cases. In the future, we will apply the proposed approxi- 
mation in a systematic study of parameter constraints and 
for a test of the Gaussianity of the CMB fluctuations. 
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